DeepPyramid: Enabling Pyramid View and Deformable Pyramid Reception for Semantic Segmentation in Cataract Surgery Videos
نویسندگان
چکیده
Abstract Semantic segmentation in cataract surgery has a wide range of applications contributing to surgical outcome enhancement and clinical risk reduction. However, the varying issues segmenting different relevant structures these surgeries make designation unique network quite challenging. This paper proposes semantic network, termed DeepPyramid, that can deal with challenges using three novelties: (1) Pyramid View Fusion module which provides varying-angle global view surrounding region centering at each pixel position input convolutional feature map; (2) Deformable Reception enables deformable receptive field adapt geometric transformations object interest; (3) dedicated Loss adaptively supervises multi-scale maps. Combined, we show modules effectively boost performance, especially case transparency, deformability, scalability, blunt edges objects. We demonstrate our approach performs state-of-the-art level outperforms number existing methods large margin (\(3.66\%\) overall improvement intersection over union compared best rival approach).KeywordsCataract surgerySemantic segmentationSurgical data science
منابع مشابه
Laplacian Pyramid Reconstruction and Refinement for Semantic Segmentation
CNN architectures have terrific recognition performance but rely on spatial pooling which makes it difficult to adapt them to tasks that require dense, pixel-accurate labeling. This paper makes two contributions: (1) We demonstrate that while the apparent spatial resolution of convolutional feature maps is low, the high-dimensional feature representation contains significant sub-pixel localizat...
متن کاملESPNet: Efficient Spatial Pyramid of Dilated Convolutions for Semantic Segmentation
We introduce a fast and efficient convolutional neural network, ESPNet, for semantic segmentation of high resolution images under resource constraints. ESPNet is based on a new convolutional module, efficient spatial pyramid (ESP), which is efficient in terms of computation, memory, and power. ESPNet is 22 times faster (on a standard GPU) and 180 times smaller than the stateof-the-art semantic ...
متن کاملSegmentation Pyramid Classification
Ben Gorte Dept. of Geoinformatics ITC the Netherlands [email protected] Commission III, Working Group 3
متن کاملPyramid segmentation algorithms revisited
The main goal of this work is to compare pyramidal structures proposed to solve segmentation tasks. Segmentation algorithms based on regular and irregular pyramids are described, together with the data structures and decimation procedures which encode and manage the information in the pyramid. In order to compare the different segmentation algorithms, we have employed three types of quality mea...
متن کاملVolumetric Semantic Segmentation using Pyramid Context Features Supplemental Material
Our “pyramid filtering” insight enables exact and extremely efficient per-voxel classification with minimal memory overhead. We will now demonstrate our improvement over existing techniques empirically and theoretically. For an empirical demonstration of efficiency, consider the following two alternatives to pyramid filtering: 1) the “sliding window” approach: iterate through every voxel in the...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Lecture Notes in Computer Science
سال: 2022
ISSN: ['1611-3349', '0302-9743']
DOI: https://doi.org/10.1007/978-3-031-16443-9_27